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Abstract 


There  are  two  distinct  formalizations  for  reasoning  from  observations  to  ex¬ 
planations,  as  in  diagnostic  tasks.  The  consistency  based  approach  treats 
the  task  as  a  deductive  one,  in  which  the  explanation  is  deduced  from  a 
background  theory  and  a  minimal  set  of  abnormalities.  In  the  other  treat¬ 
ment,  based  on  abduction,  the  explanations  are  considered  to  be  sentences 
that,  when  added  to  the  background  theory,  account  for  the  observations. 
We  show  that  there  is  a  close  connection  between  these  two  formalizations. 
Starting  with  a  causal  theory,  explanations  can  be  generated  either  by  ab- 
ductive  reasoning,  or  by  adding  closure  axioms  and  minimizing  causation 
within  a  deductive  framework.  The  latter  method  is  strictly  stronger  than 
the  former,  but  requires  full  knowledge  of  causation  in  a  domain. 
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1  Introduction 


Reasoning  to  the  best  explanation  is  common  task  in  many  areas  of  Artificial 
Intelligence.  One  of  the  clearest  examples  is  diagnosis,  in  which  one  reasons 
from  observations  such  as  patient  symptoms  to  their  underlying  causes,  a 
disease  or  physiological  malfunction.  In  the  literature,  there  are  two  funda¬ 
mentally  different  formalizations  of  this  task  [11;  9].  In  one,  the  process  of 
finding  a  cause  is  treated  as  a  straightforward  abductive  task.  Representative 
of  this  approach  is  the  set-covering  model  of  diagnosis  [12],  which  assumes 
two  disjoint  sets,  d  a  set  of  disorders,  and  m  a  set  of  manifestations.  Disor¬ 
ders  are  assumed  to  “cause”  manifestations,  represented  by  a  relation  d  x  m. 
The  problem  of  diagnosis  is  recast  as  the  problem  of  finding  a  minimal  cover 
of  observed  manifestations  m '  C  m,  that  is,  a  minimal  subset  of  d  that  causes 
m! . 

The  competing  formalization,  the  consistency  based  approach,  is  best 
represented  by  Reiter’s  theory  of  diagnosis  from  first  principles  [14].  In  this 
theory,  the  functionality  of  a  system  containing  a  finite  number  of  compo¬ 
nents  is  characterized  by  a  set  of  first-order  sentences,  the  background  theory. 
The  special  predicate  a b(c)  is  used  to  state  that  the  component  c  is  abnormal 
or  not  functioning  correctly.  The  current  behavior  of  the  system  is  given  by 
a  set  of  observation  sentences.  A  diagnosis  of  the  behavior  is  a  minimal  set 
of  abnormality  assumptions  that  is  consistent  with  the  observations  and  the 
background  theory. 

These  two  formalizations  seem  fundamentally  different.  The  abductive 
approach  looks  for  a  set  of  causes  that  will  imply  the  observations;  the  con¬ 
sistency  based  approach  looks  for  a  set  of  abnormality  assumptions  that 
are  consistent  with  the  observations.  Nevertheless  there  is  a  connection  be¬ 
tween  the  two:  Reiter  showed  how  to  express  the  set-covering  model  within 
his  framework.  Recently,  Console  |2]  and  Poole  [9]  have  shown  that  either 
formalization  can  be  used  in  restricted  settings  to  compute  the  same  expla¬ 
nations  for  diagnosis.  In  the  abductive  framework,  the  domain  theory  has 
axioms  that  relate  causes  and  their  effects,  e.g.,  c,-  D  c  would  be  used  to 
say  that  the  effect  e  is  a.  result  of  cause  c,.  A  corresponding  consistency 
based  theory  is  created  by  adding  closure  axioms  stating  that,  the  only  way 
to  achieve  an  effect  is  by  the  set  of  causes  given  (e  D  c-j  V  Co  V  •■•).  The 
closure  axioms  are  local  in  that  they  are  easily  derived  by  looking  a.t  all  the 
implications  that  have  a  common  head  atom.  The  explanations  computed 
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by  the  two  methods  are  the  same,  as  long  as  the  domain  theory  contains  just 
horn-clause  implications  from  causes  to  effects,  and  is  acyclic. 

This  result  applies  to  diagnostic  tasks  that  require  explanations,  that  is, 
the  unexpected  observations  must  be  predicted  or  explained  from  the  as¬ 
sumed  malfunctions.  In  the  literature,  explanatory  diagnosis  is  usually  sig¬ 
nalled  by  the  presence  of  fault  models  [15;  4].  Reiter’s  framework  may  also  be 
used  for  a  weaker  form  of  diagnosis,  which  could  be  called  excusing  diagnosis: 
identify  components  that,  if  malfunctioning,  would  cancel  or  excuse  predicted 
normal  behavior  of  the  system  that  conflicts  with  the  observations.  Here  we 
look  only  at  the  case  of  explanatory  diagnosis  (and  causal  explanation  in 
general),  since  excusing  diagnosis  has  no  analog  in  the  abductive  framework. 

The  restrictions  on  the  domain  theory  for  the  Console/Poole  result  are 
very  tight;  in  particular,  there  can  be  no  correlation  information  (e.g.,  that 
two  causes  are  mutually  exclusive,  or  that  one  effect  is  the  negation  of  an¬ 
other)  or  uncertainty  (e.g.,  a  cause  implying  a  disjunction  of  effects).  In 
this  paper  we  will  examine  the  connection  between  abduction  and  closure 
in  the  setting  of  explanation  in  general  causal  models,  allowing  correlations, 
uncertainty,  and  acyclicity  in  the  causal  structure.  We  answer  the  following 
questions. 

•  Is  there  a  notion  of  explanatory  closure  that  is  appropriate  for  the  more 
general  domain  theory?  Is  there  an  equivalent  local  closure? 

•  Is  consistent  explanatory  closure  of  a  general  domain  theory  possible? 

•  When  consistent  closure  is  possible,  does  minimization  of  causes  in  the 
closed  theory  compute  the  same  explanations  as  does  abduction  in  the 
original  theory? 

There  are  both  positive  and  negative  results.  With  an  appropriate  notion  of 
explanatory  closure,  given  certain  technical  conditions,  the  consistency  based 
approach  will  compute  the  same  explanations  as  the  abductive  approach. 
However,  the  utility  of  the  former  method  is  open  to  question,  since  local 
closure  will  no  longer  suffice  for  explanatory  closure:  there  seems  to  be  no 
wav  to  close  the  domain  theory  other  than  by  computing  all  explanations. 
Further,  the  consistency  based  method  is  strictly  stronger  than  the  abductive 
one  in  explanatory  diagnosis  tasks,  and  the  answers  it  produces  may  have 
elements  that  are  not  relevant,  to  a  causal  explanation. 


2  Simple  Causal  Theories 

We  are  interested  in  domains  in  which  there  is  a  concept  of  cause  and  effect. 
Much  of  our  commonsense  view  of  the  world  can  be  cast  into  this  form.  Typ¬ 
ical  here  is  reasoning  about  actions  or  events  and  their  results,  usually  for¬ 
malized  in  the  situation  calculus  or  some  variant  [7].  Other  domains  include 
medical  diagnosis  with  diseases  as  causes,  symptoms  as  effects;  mechanical  or 
electrical  systems  with  components  and  inputs  as  causes,  outputs  as  effects; 
and  planning  domains  with  plans  as  causes,  actions  as  effects. 

While  there  is  a  great  deal  of  complexity  and  controversy  in  defining 
causation,  for  this  paper  most  of  these  problems  can  be  bypassed  because 
we  are  interested  in  a  formal  representation  of  the  simplest  aspects  of  causal 
consequence,  given  by  the  following  definition. 

DEFINITION  2.1  Let  C  be  a  first-order  language.  A  simple  causal  theory  is 
a  tuple  (C,  E,  S)  where 

•  C,  a  set  of  atomic  sentences  of  C,  is  the  causes. 

•  E,  a  set  of  sentences  of  C,  is  the  effects. 

•  E,  a  set  of  sentences  of  C.  is  the  domain  theory. 

The  set  C  contains  those  atomic  propositions  which  represent  the  possible 
causative  agents  of  the  domain.  If  we  are  looking  for  an  answer  to  the 
question  of  “what  caused  e?”,  then  an  acceptable  answer  is  some  subset  of 

C'.1 

Effects  E  are  those  aspects  of  the  domain  that  we  might  observe  and 
about  which  we  w'ant  to  know  the  cause.  Note  that  E  and  C  need  not  be 
disjoint;  an  observed  cause  ma.y  require  no  further  explanation. 

The  domain  theory'  E  contains  information  about  the  relation  between 
causes  and  effects.  For  example,  in  the  situation  calculus  we  might  take  C 
to  be  occurrences  of  actions,  E  to  be  properties  of  the  final  sl  ate,  and  E  to 
hold  information  about  the  initial  state  and  the  way  in  which  actions  affect 
properties  of  situations. 

1  Allowing  only  atoms  simplifies  the  analysis,  but  is  not  restrictive,  since  we  can  include 
equivalences  such  as  c  =  <f>,  where  <j>  is  a  complex  sentence. 
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Here  is  a  simple  causal  theory  that  will  be  used  as  an  example  in  the  rest 
of  the  paper;  a  graphical  presentation  appears  in  Figure  1.  The  intended 
meaning  of  the  predicates  should  be  obvious  from  their  names. 

Causes:  rain,  sun,  warm,  sprinkler 

Effects:  wet-fawn,  wet-road 

Domain  theory:  rain  D  wet-roacl,  rain  D  wet-lawn,  sun  -  -train 

sprinkler  D  wet-lawn,  sun  A  warm  3  sprinkler 

A  notational  convention:  a  finite  set  of  sentences  will  often  be  taken  as  a 
conjunction,  e.g.,  if  A  and  B  are  such  sets,  we  write 

A  V  B  for  (a;  A  <*2  •  •  •)  V  (6j  A  &2  •  ■  •) 

->A  for  -’(«.)  A  «2  •  ■  •)  . 
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3  The  Abductive  Approach 

Given  a  simple  causal  theory,  the  problem  of  reasoning  from  observations  to 
causes  can  be  expressed  formally  using  abduction.  The  account  of  logical 
abduction  we  give  here  draws  on  ideas  already  present  in  the  literature  (e.g., 

[9])- 

DEFINITION  3.1  Let  (C,E,  S)  be  a  simple  causal  theorxj.  An  explanation 
of  a  set  of  obsei'vations  O  C  E  is  a  finite  set  ACC  such  that 

•  A  is  consistent  with  S. 

•  sudt-o. 

•  A  is  subset-minimal. 

If  0  has  a  nonzero  finite  number  of  explanations,  then  the  cautious 
explanation  is  their  disjunction:  Vi  A,-. 

Remarks.  /I  must  be  a  minimal  set  of  members  of  C\  by  minimal  is  meant 
there  is  no  other  explanation  that  is  a  proper  subset.  If  these  atoms  consti¬ 
tute  valid  causes  of  the  observed  effects,  and  if  there  is  an  explanation  that 
contains  fewer  causes,  it  should  Ire  preferred.  Other  than  this  we  say  noth¬ 
ing  about  preferences  among  multiple  explanations.  It  is  obvious  that  often 
such  preferences  will  be  required  for  reasoning,  e.g.,  we  may  want  the  most 
specific  explanation,  or  the  most  normal  (where  we  partition  causes  into  ones 
that  normally  occur  and  ones  that  do  not),  or  the  A’-est,  where  X  is  some 
measure  on  explanations.  The  preference  could  be  expressed  mathematically 
by  a  partial  order  on  the  subsets  of  C .  Since  such  an  order  will  be  closely 
related  to  the  domain  of  application,  and  we  have  no  way  of  making  any 
general  statements  about  the  order,  we  omit  it  from  further  consideration 
here. 

In  a  given  problem  domain,  we  may  be  interested  in  the  best  explana¬ 
tion,  or  the  cautious  explanation,  or  even  any  (satisficing)  explanation.  For 
example,  if  we  want  to  predict  the  possible  states  of  the  world  under  a  series 
of  events,  then  the  cautious  explanation  might  be  most  appropriate,  while 
tasks  like  plan  recognition  usually  require  the  best  explanation.  And  for  some 
problems  there  is  no  ordering  of  solutions,  and  any  one  would  be  acceptable. 


Finally,  it  is  possible  that  one  explanation  A  will  imply  another  A'  in  a 
simple  causal  theory.  For  example,  sun  and  warm  implies  sprinkler  in  the 
sample  theory.  More  generally,  let  Ai  V  A2  V  •  •  •  V  An  be  any  disjunction 
of  explanations  for  0]  Ax  is  independent  for  0  in  the  theory  S  if  £  U  A\  ^ 
A2  V  A3  V  •  •  •  V  An. 

Using  the  example  causal  theory  of  the  previous  section,  there  are  three 
explanations  of  0  =  {wet-lawn},  namely  {rain},  {spi'inA'fei-},  and  {sun,  warm} 
The  cautious  explanation  is  rain  V  sprinkler  V  (sun  A  warm),  which  simpli¬ 
fies  in  the  domain  theory  to  rain  V  sprinkler.  The  explanation  sun  A  warm 
is  not  independent,  since  it  implies  sprinkler.  The  observation  set  0  = 
{wet-roac/,  wet-lawn}  has  the  single  explanation  {rain},  which  is  also  its  cau¬ 
tious  explanation. 


8 


4  The  Consistency  Based  Approach 

The  consistency  based  approach  to  explanation  is  fundamentally  different 
from  abduction  in  its  formal  specification.  We  start  with  a  structure  (C,  E,  II) 
similar  to  a  simple  causal  theory,  except  that  the  domain  theory  II  now  gives 
necessary  causal  conditions  for  the  occurrence  of  an  effect;  we  comment  on 
this  more  below.  We  call  this  theory  an  inverse  causal  theory. 

DEFINITION  4.1  Let  (C,  E}  n)  be  an  inverse  causal  theory,  and  0  a  subset 
of  E  (the  observation  set).  A  denial  set  for  O  is  a  maximal  subset 
D  C  C  such  that 

II  U  0  U  {-‘d  |  d  6  D}  is  consistent. 

The  complement  of  the  denial  set  with  respect  to  C  (C  —  D)  is  called  a 
diagnosis  for  0? 

Because  a  denial  set  is  subset-maximal,  the  diagnosis  is  a  deductive  conse¬ 
quence  of  the  domain  theory,  the  observations,  and  the  denial  set,  as  proven 
in  [14]: 

IWOU{->d\deD}\-C-D .  (1) 

The  difference  between  the  consistency  based  approach  and  abduction  is 
twofold.  First,  the  form  of  inference  is  distinct:  rather  than  abducing  causes 
that  imply  the  observations  0  given  the  domain  theory  S,  the  consistency 
approach  tries  to  minimize  the  extent  of  the  causation  set  C  by  denying  as 
many  of  its  elements  as  possible.  Second,  these  methods  encode  knowledge 
of  the  domain  differently:  in  the  abductive  framework,  there  are  implications 
from  the  causes  to  the  effects,  while  in  the  consistency  based  systems,  the 
most  important  information  seems  to  be  the  implication  from  observations 
to  possible  causes.  For  example,  in  reconstructing  the  set-covering  model  of 
diagnosis,  Reiter  [14]  uses  axioms  of  the  form: 

OBSERVED(m)  D  PRESENT^/])  V  •  •  •  V  PHESENT(r/„), 

-'In  the  original  paper  ([14]),  a  diagnosis  was  defined  differently  and  then  proven  to  be 
the  complement  oF  the  denial  set. 
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where  m  is  the  observed  symptom  and  d,  are  diseases  that  cause  the  symp¬ 
tom.  These  axioms  give  necessary  conditions  for  the  observation,  namely, 
that  one  of  a  set  of  diseases  be  present.  An  inverse  causal  theory  contains 
statements  of  this  form.  In  fact,  under  certain  conditions  on  the  domain 
theory,  diagnoses  and  explanations  coincide. 

Theorem  4.1  (Console,  Poole)3  Let  (C,E,Yl)  be  a  simple  causal  the¬ 
ory  over  a  propositional  language,  with  S  a  set  of  nonatomic  definite 
clauses  whose  directed  graph  is  acyclic.  Let  C  be  a  set  of  atoms  that  do 
not  appear  in  the  head  of  any  clause  ofE,  and  E  any  set  of  atoms.  Let 
fl  be  the  Clark  completion  [l]  of  E.  Then  the  diagnoses  of  (C,  E,  II) 
are  exactly  the  explanations  of  (C,E,E). 

The  simple  causal  theory  of  Figure  1  does  not  satisfy  the  conditions,  because 
it  contains  the  equivalence  sun  =  ~>rain,  and  sprinkler  is  a  cause  that  appears 
as  the  head  of  a  clause.  If  we  eliminate  these  anomalies,  then  the  Clark 
completion  of  the  domain  theory  is: 

wet-road  =  rain 

wet-lawn  =  rain  V  sprinkler  (2) 

sprinkler  =  warm  A  sun 

The  diagnoses  of  wet-lawn  are  {rain}  and  {sun,  warm};  the  explanation 
{sprink/ej-}  is  missing. 

For  more  complicated  domain  theories,  Clark  completion  does  not  give 
the  required  closure  over  explanations.  If  the  theory  has  cycles,  for  example 
{«  3  6,  a'  3  b,  b  3  ft},  then  the  completion  will  only  pick  out  a.  subset  of  the 
explanations  {in  this  case,  b  =  ft).  If  there  is  disjunction  in  the  head  of  a 
clause,  the.  completion  is  undefined. 

In  the  next  few  sections  we  will  extend  the  scope  of  Theorem  4.1  by 
considering  a.  more  general  notion  of  completion  for  a  simple  causal  theory, 
that  of  explanatory  closures. 


3Neither  of  these  authors  states  the  theorem  in  this  form,  although  Poole  [10]  is  close. 
It  is  clear  that  the  theorem  follows  from  their  results. 
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5  Explanatory  closures 

Let  (C,  E ,  E)  be  a  simple  causal  theory,  and  suppose  g  €  E  has  a  cautious 
explanation  V;  A Now  consider  the  statement 

g  D  Aj  V  A2  V  ■■■  V  An  ,  (3) 

where  we  understand  each  A,-  to  be  the  conjunction  of  its  elements.  This 
expression  says  that  whenever  g  is  present,  it  must  have  been  caused  by  one 
of  the  Ai]  we  call  this  expression  the  explanatory  closure  of  g  with  respect  to 
the  simple  causal  theory  { C,E ,  E);  it  is  abbreviated  7(g).  If  the  explanatory 
closures  of  all  effects  E  exist,  then  the  theory  (C,  E,  II)  formed  by  adding 
the  closures  to  E  is  called  the  closure  of  (C,E,  S). 

By  forming  the  closure  of  a  causal  theory  we  can  deduce  the  cautious 
explanation  from  any  given  effect.  One  immediate  question  is  whether  we 
should  add  something  stronger  or  weaker  to  close  the  theory.  If  we  add 
a  stronger  closure,  then  we  have  excluded  some  original  explanation  from 
consideration;  e.g.,  if  the  explanatory  closure  is  g  3  a1Va2,  and  we  use  g  D  a  1 
instead,  then  «]  is  the  only  explanation  for  g.  On  the  other  hand,  suppose 
instead  we  add  <7  3  («i  V  «2  V  S)  for  some  arbitrary  sentence  S.  If  we  try  to 
derive  explanations  by  minimizing  causes,  then  since  _|(«i  V  «2)  is  consistent 
with  the  closure,  we  could  assume  it,  and  derive  S  as  the  “explanation”  for 
<7,  which  is  certainly  not  intended. 

Another  question  is  whether  explanatory  closures  are  always  consistent 
with  the  original  causal  theory,  and  if  so,  whether  the  original  explanations 
remain  unchanged.  Unfortunately,  the  answer  to  both  parts  of  this  question 
is  “no.” 

EXAMPLE  5.1  Let  ({«i,a2,03},{<7i,(/2,<73},E)  be  a.  simple  causal  theory, 
with  E  equal  to  the  conjunction  of 

«i  A  a2  3  g\  a,\  A  a2  3  #2  V  g3  ~'U\  V  -i«2  V  -<«3 

«2  A  a3  3  g2  «2  A  «3  3  </i  V  g3  g\  V  (/2  V  g3 

«3  A  a]  3  g3  a3  A  a,!  3  gi  V  g? 

The  closures  of  this  theory  are 

</i  3  fi]  A  0-2 
g-2  3  <h  A  03 
.<7.3  3  «3  A  <t\  . 
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It  is  easy  to  show  that  the  conjunction  of  these  closures  is  inconsistent 
with  E.4 

The  technical  conditions  for  inconsistency  are  somewhat  complicated,  and  it 
takes  some  work  to  create  a  causal  theory  that  will  have  inconsistent  closures; 
e.g.,  the  example  of  Figure  1  can  be  consistently  closed,  but  it  was  not  origi¬ 
nally  designed  with  this  property  in  mind.  The  necessary  conditions  involve 
interacting  effects  and  causes  such  that  in  the  causal  theory  at  least  one  of 
the  effects  is  true,  and  one  of  the  causes  false.  The  following  proposition 
states  this  more  precisely. 

PROPOSITION  5.1  Let  {7  (</,-)  I  0  <  i  <  ??.}  be  a  set.  of  closures  for  (C ,  E,  S). 
For  each  i  <  n,  let  p;  be  either  yx  or  ->Ai,  where  is  any  explanation 
for  gi .  Each  sentence 


Px  V  p2  V  •  •  •  V 

must  be  a  theorem  of  S  for  these  closures  to  be  inconsistent  with  S. 

Proof.  For  the  closures  to  be  inconsistent,  Vi  ~ 1  ( T ( </»' ) )  must  be 
a  theorem  of  E.  We  have: 

V,-  ;))  =  V;(ff«  A  ~'Egi) 

=  v,- [<Ji  a  ->A}  A  ->Aj  •  •  •) 

where  Egi  is  the  cautious  explanation  for  and  the  /1,’s  are 
all  explanations  for  it.  The  proposition  follows  by  tautological 
consequence. 

We  now  turn  to  the  question  of  how  adding  closures  can  modify  explana¬ 
tions. 

Example  5.2  Let  «.i}, {</] ,</•>, #3}, S)  be  a  simple  causal  theory, 

with  E  equal  to  the  conjunct  ion  of 

'Ll.  was  suggested  by  a  reviewer  that,  the  sentence  t/\  D  («i  A  «•_>)  V  (re_>  A  0.3  A  — 1//3)  V 
(cr  1  A  (13  A -1173)  and  similar  ones  for  the  other  effects  be  used;  these  closures  are  consistent 
with  the  domain  theory.  However,  as  noted  above,  this  would  generate  an  anomalous 
explanation:  by  asserting  -if/],  we  derive  rn>  A o:i.  which  is  not  an  explanation  for  e/1 . 
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a i  D  9i  ->(-£1  V  -><z3 
«i  D  92  as  D  9i  V  g2 

«3  D  93 
«4  D  .93 

The  closures  of  this  theory  are 

91  «i 

92  D  «i 

93  D  «3  V  ££'i  • 

If  the  first  two  closures  are  added  to  S,  a\  becomes  true,  013  becomes 
false,  and  the  only  explanation  for  g3  is  a,(. 

This  example  shows  that  some  causes  may  become  true  or  false,  thus  modi¬ 
fying  the  available  explanations.  However,  new  explanations,  which  are  not 
subsets  of  old  ones,  do  not  arise  from  the  addition  of  closures. 

Proposition  5.2  Let  (C,E,E)  be  a  simple  causal  theory,  and  {7(9,)}  a 
set  of  explanatory  closures  with  respect  to  it.  Suppose  II  =  £  U  {7(9,)} 
is  a  consistent  set.  For  an  arbitrary  effect  g,  every  explanation  of  g 
w.r.t.  II  is  a  subset  of  some  explanation  of  g  w.r.t.  E. 

Proof.  Assume  A  is  an  explanation  for  g  w.r.t.  II,  but  there 
is  no  A'  D  A  such  that  A'  is  an  explanation  for  g  w.r.t.  E.  Us¬ 
ing  a  technique  similar  to  that  of  Proposition  5.1,  the  following 
must  be  theorems  of  S,  where  each  p,  is  either  g,  or  ->j4;  for  any 
explanation  A;  of  gp 

Pi  V  p-2  V  •  •  •  V  pn  V  ->A  V  g  . 

Choosing  each  p;  to  be  -'Ai,  this  is  a  sentence  which  contradicts 
the  original  assumption. 

5.1  Augmented  Domain  Theories 

Rather  than  trying  to  determine  if  a  causal  theory  has  a  consistent  closure, 
we  might  find  it  useful  to  modify  the  theory  so  that  it  does.  The  simplest 
way  to  do  this  is  to  add  an  escape  cause  for  each  efFect:  a  new  cause  r,-  is 
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included  in  C  for  each  g,-,  and  the  sentence  r,-  D  g ,  is  added  to  E.5  The 
new  causes  are  sufficiently  isolated  from  the  original  domain  theory  so  that 
inconsistency  cannot  result.  In  effect,  the  closure  conditions  no  longer  force 
one  of  the  original  explanations  for  g-x  to  be  true,  .since  r,-  is  an  alternative. 
Further,  augmented  theories  do  not  change  their  original  explanations  at  all 
when  closures  are  added. 

Proposition  5.3  Let  {C\E,T/)  be  a  simple  causal  theory  formed  from 
{ C ,  E ,  E)  by  adding  r,  to  C  and  r;  D  g,  to  E  for  each  g{  £  E:  call  this  an 
augmented  causal  theory.  Suppose  that  {7(4, )}  is  a  set.  of  explanatory 
closures  with  respect  to  the  augmented  theory,  and  let  II  =  S'Ub  (#)}• 
Then  II  is  consistent,  and  for  an  arbitrary  effect  g,  a  subset.  A  C  C 
is  an  explanation  of  g  w.r.t.  II  if  and  only  if  it  is  an  explanation  of  g 
w.r.t.  S. 

Proof.  By  Proposition  5.1,  if  the  closure  of  E'  is  to  be  inconsis¬ 
tent,  the  following  must  be  theorems  of  S',  where  each  p,  is  either 
g;  or  ->n: 


Pi  V  p2  V  •  *  •  V  pn  . 


Because  the  only  expressions  containing  r,-  are  of  the  form  rt-  D  </,■ , 
the  above  sentences  are  theorems  of  E'  only  if  there  are  corre¬ 
sponding  theorems  of  E  with  each  replaced  by  “yy,-.  This  is 
impossible,  since  such  a.  set  is  unsatisfiable. 

Assume  A  C  C  is  an  explanation  for  g  w.r.t.  II,  but  not  w.r.t. 
E.  By  reasoning  similar  to  that  in  the  proof  of  Proposition  5.3, 
the  following  must  be  theorems  of  Eh  where  each  j),  is  either  g, 
or  -77: 


Pi  V  p2  V  •  -  •  V  pn  V  -1 A  V  g  . 

By  tautological  consequence,  these  sentences  imply  /I  D  g,  con¬ 
tradicting  the  initial  assumption. 

5 Escape  causes  are  the  same  idea  as  the  unknown  faults  of  [•!:  15]. 
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5.2  Local  Closure 


Cautious  explanations  for  a  proposition  g  are  defined  by  reference  to  the 
entire  contents  of  the  causal  theory  S.  Is  there  a  way  of  deriving  these 
explanations  in  a  local  manner,  that  is,  by  looking  only  at  the  sentences 
of  S  in  which  g  occurs?  From  Theorem  4.1,  Clark  completion  works  for 
a  restricted  language.  But  if  arbitrary  correlations  are  allowed  in  E,  then 
adding  cautious  explanations  by  a  local  closure  operation  is  not  possible.  The 
simplest  example  showing  this  contains  loops  in  the  implication  structure; 
e.g.,  let  E  be 

aDg  a' DbbDg 

g  D  c  c  Z)  b 

Let  a  and  a1  be  the  causes.  Adding  the  local  closure  g  D  a  V  b  is  insufficient, 
because  it  is  subsumed  by  g  3  c  D  b,  so  that  a  as  a  cause  of  g  will  never  be 
inferred.  Any  local  closure  for  g  cannot  find  the  connection  between  c  and  6, 
and  thus  has  the  chance  of  being  incorrect. 

Loops  in  the  implication  structure  also  cause  problems  for  other  global 
closure  methods  such  as  circumscription,  which  is  equivalent  to  Clark  com¬ 
pletion  for  the  restricted  language  [13].  In  the  case  of  the  above  example, 
minimizing  g  while  holding  the  causes  fixed  yields  g  D  6,  which  is  again 
stronger  than  the  explanatory  closure. 


6  Closure  +  Minimization  Implies  Abduc¬ 
tion 

The  closure  of  a  causal  theory  contains  the  explanatory  closure 
g  15  Aj  V  A2  V  •  •  •  V  A.n 

of  each  effect  g.  Suppose  the  closed  theory  is  consistent,  and  we  observe  g. 
Then  Ai  V  A2  V  ■  •  •  V  An  is  true  in  all  models  of  g  and  the  closed  theory.  If  we 
now  try  to  minimize  causes,  that  is,  to  assert  for  as  many  explanations  as 
possible,  we  will  eliminate  possible  explanations  from  the  disjunction,  until 
we  are  left  with  a  single  one.  Thus  we  can  perform  abductive  reasoning  in 
the  consistency  based  approach. 

There  is  one  caveat  to  this  reasoning:6  if  an  explanation  Aj  is  not  in¬ 
dependent,  then  it  will  not  be  found  by  closure  and  minimization.  Suppose 
there  is  another  A 2  that  is  implied  by  A\  and  the  domain  theory;  then  Ai  will 
be  shadowed  from  the  minimization  by  A2:  we  cannot  assert  -<A2  without 
concluding  — >/4i .  Thus  using  closure  and  minimization  will  only  produce  the 
independent  explanations. 

This  discussion  is  made  more  precise  with  the  following  proposition. 

THEOREM  6.1  Let  (C,E,?j)  be  a  simple  causal  theory,  and  suppose  that 
{C,  E,  II) ,  its  closure,  is  consistent  and  does  not.  entail  an  effect  g.  Let 
A  be  an  explanation  for  g  in  S,  and  suppose  that  A  is  consistent,  ■ with 
II  and  independent  in  II.  Then  A  is  a  subset  (riot  necessarily  proper) 
of  some  diagnosis  for  g  in  II. 

Conversely,  every  diagnosis  for  g  in  II  is  a  superset  (not  necessarily 
proper)  of  some  explanation  for  g  in  S. 

Proof.  Suppose  A  is  an  explanation  of  g  (in  E),  and  let  X  be  the 
disjunction  of  the  rest  of  g's  explanations:  A2  V  A3  V-  •  -  V  A„.  ~>X 
is  consistent  with  II,  or  else  II  j=  X  and  so  II  |=  g,  contradicting 
the  assumptions.  Also,  by  assumption,  II  U  A  is  consistent,  and 
since  A  is  independent  in  II,  II  U  A  U  -‘X  is  consistent,  and  hence 
so  is  FI  U  {g}  UrnY.  Let  m  be  a  model  of  IIU  {</}  -f-uY,  and  let  D  = 

am  indebted  to  Ennolc  Paek  for  pointing  out  this  problem. 
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{a2,  «3,  - . .  an}  be  a  set  of  elements,  one  from  each  explanation  of 
A’,  that  are  false  in  m.  Let  =  (-1  a2  A  -'a 3  A  . . .  -■a,,).  Now 
IT  U  {</}  U  ~D  is  consistent,  and  because  of  the  presence  of  the 
closure  of  g,  A  is  a  consequence  of  it.  D  can  be  extended  to  some 
maximal  set  D'  that  is  a  denial  set  for  g,  and  its  complement 
w.r.t.  C  contains  A. 

For  the  converse  part,  let  II U  {<7}  U  be  consistent  for  some 
denial  (maximal)  set  D  C  C.  Suppose  the  associated  diagnosis 
H  is  not  a  superset  of  any  explanation  of  g  in  £.  Then  for  any 
explanation  A,-  of  g ,  ->A{  is  consistent  with  II  U  {5}  U  ~D,  and  so 
D  by  maximally  must  contain  some  element  of  each  of  A,.  Thus 
~D  D  -‘Eg,  where  Eg  is  the  cautious  explanation  for  g.  This  is 
a  contradiction,  since  g  D  Eg  is  a  sentence  of  II. 

Remarks.  This  theorem  shows  the  general  correspondence  between  abduc- 
tive  and  consistency  methods.  If  an  inverse  causal  theory  is  formed  by  closing 

а.  causal  theory,  then,  with  several  restrictions,  consistency  based  diagnoses 
and  abductive  based  explanations  are  isomorphic  to  one  another.  The  re¬ 
strictions  have  to  do  with  the  problems  encountered  in  adding  closures  to  a 
causal  theory;  given  the  results  of  the  last  section,  it  may  not  be  possible  to 
do  so  consistently,  as  explanations  may  change,  and  so  forth. 

This  is  not  to  say  that  the  two  approaches  are  equivalent,  however.  The 
consistency  based  method  in  genera]  entails  more  than  the  abductive  one,  as 
a  consequence  of  adding  the  closures. 

COROLLARY  6.2  Same  conditions  as  Theorem  6.1  above.  For  some  denial 
set  D,  every  consequence  of  £  and  A  is  a  consequence  of  1 1  and  D.  On 
the  other  hand,  some  consequence  of  D  and  II  may  not  be  consequences 
of  S  and  A. 

In  Example  5.2,  -<(1.3  is  a  consequence  of  II,  but  not  of  any  abductive  expla¬ 
nation  for  g> . 

б. 1  Representational  Issues 

The  results  of  this  paper  must  be  interpreted  in  light  of  both  the  domain 
information  available  and  the  task  at  hand.  We  will  briefly  examine  three 
areas:  temporal  projection,  plan  recognition,  and  diagnosis. 
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Closure  conditions  may  be  given  directly  as  part  of  the  axiomatization 
of  a  domain.  This  is  the  case  with  many  diagnostic  tasks  in  which  complete 
fault  models  [15]  are  given,  characterizing  the  exa.ct  relation  between  the 
input/output  behavior  of  a  system  and  its  internal  states.  A  similar  analysis 
can  be  given  for  the  system  of  [8]  on  temporal  projection,  where  the  “internal 
state”  is  the  set  of  event  occurrences  and  the  axiomatization  specifies  exactly 
what  events  must  occur  given  a  sequence  of  states,  and  vice  versa. 

On  the  other  hand,  often  one  has  information  about  causal  effects,  to¬ 
gether  with  some  noncausal  correlations  (e.g.,  forbidden  states)  and  would 
like  to  generate  explanatory  diagnoses.  In  order  to  employ  the  consistency 
based  approach,  the  explanatory  closures  must  be  generated  and  added.  Here 
the  form  of  the  causation  axioms  can  be  exploited.  If  they  are  horn,  definite 
and  acyclic,  then  local  closure  (Clark  completion)  can  be  used.  For  more 
complicated  theories,  a  technique  such  as  circumscription  may  be  appropri¬ 
ate.  An  example  here  is  the  theory  of  plan  recognition  in  [5].  The  domain 
theory  is  a  hierarchical  set  of  actions;  the  causes  are  the  goals  at  the  highest 
level  of  the  hierarchy,  the  so-called  END  events.  Relations  between  actions 
at  different  levels  in  the  hierarchy  are  given  by  a  first-order  domain  theory. 
Circumscription  is  used  to  close  off  the  axioms,  producing  the  explanatory 
closure  axioms.  Given  a  set  of  observed  actions,  minimizing  over  the  END 
events  produces  an  explanation  of  the  observations.' 

Another  good  example  of  the  derivation  of  closure  axioms  is  from  the 
theory  of  temporal  projection  in  [6];  in  effect,  this  theory  is  similar  to  that 
of  [8]  above,  with  the  following  differences.  First,  the  sequence  of  actions 
is  fully  specified  by  the  result  function,  but  exceptions  to  the  actions  are 
allowed,  in  the  form  of  miracles:  these  are  the  assumable  atoms.  Second, 
there  is  a  theory  of  causation  for  action  types,  which  is  used  to  generate  the 
closure  conditions  by  circumscription.  That  is,  the  causation  axioms  state 
what  must  follow  if  the  preconditions  of  an  action  hold  and  the  action  takes 
place;  circumscription  then  generates  the  closure  axioms.  Minimization  over 
miracles  gives  the  desired  explanations. 

The  motivation  behind  the  multiple  circumscription  in  systems  such  as 
[5;  6]  has  often  been  obscure.  Given  the  results  of  this  paper,  it  should  be 
clear  that  the  circumscriptions  are  performing  abduction  by  using  closure 
and  minimization.  Whether  the  circumscription  corresponds  to  an  appropri- 


' Tli is  account  is  of  necessity  somewhat  simplified. 


Figure  2:  A  double  inverter 


ate  closure  can  be  tested  by  checking  whether  it  produces  the  explanatory 
closure  axioms,  and  whether  adding  these  axioms  changes  the  set  of  explana¬ 
tions.  The  examples  and  propositions  of  Section  5  should  be  helpful  in  this 
regard;  for  example,  by  adding  escape  causes  it  is  always  possible  to  retain 
the  original  causal  structure.  In  general,  if  there  are  cycles  in  the  implication 
structure  of  the  causal  domain  theory,  then  neither  circumscription  nor  local 
closure  will  work  correctly  in  generating  explanatory  closures. 

6.2  Logic  Based  Diagnosis 

In  logic  based  diagnosis  methods  derivative  of  [14],  the  domain  theory  takes 
on  a  restricted  form,  with  a  distinguished  set  of  abnormality  predicates  ab¬ 
used  to  describe  the  expected  behavior  of  a  system.  For  explanatory  diag¬ 
nosis,  a  complete  fault  model  is  required,  so  that  both  normal  and  abnormal 
behaviors  are  lully  specified.  For  example,  consider  the  double  inverter  of 
Figure  2.  The  domain  axioms  are 

-<abj  D  (in;  =  -iouf,-) 

a  hi  D  (in;  =  out,-)  (5) 

Olifa  =  jjlb  ■ 

Each  inverter  can  either  have  normal  behavior,  or  have  a  short  circuit  so  that 
input  and  output  are  the  same.  Let  C  =  {ah,}-  If  we  observe  {ina,  out*,}, 
then  there  is  one  diagnosis,  the  empty  set.  To  compare  this  to  the  abductive 
approach,  we  must  modify  the  definition  of  explanation  slightly,  allowing 
-'(ib;  as  nonatomic  causes. s  Let  C  =  CU{-'c|c€C},  and  define  a  default 
explanation  of  0  in  a  theory  { C,E ,  E}  to  be  an  explanation  of  O  minimal 
in  C-a.toms,  restricted  to  just  these  atoms.  In  this  example,  the  explanations 
are  {->ab„, -■«/>(,}  and  {ab,,, a/>&};  the  first,  of  these  is  minimal  in  a/>a.toms, 

8We  could  also  change  the  vocabulary  and  add  ok;  —  -mb,  as  a  new  set  of  causes,  and 
then  use  Theorem  0.1.  But.  this  would  change  the  diagnoses  in  logic  based  diagnosis,  by 
eliminating  the  distinction  between  normal  and  abnormal  behavior. 


and  the  default  explanation  is  the  empty  set.  i.e.,  the  diagnosis.  In  fact,  we 
can  show  in  general  that  the  default  explanations  correspond  to  diagnoses. 

THEOREM  6.3  Let  c  —  C  U  {->C  I  c  £  C },  and  let  (C',E,E)  be  a  simple 
causal  theory.  Suppose  II,  the  explanatory  closure  of  E,  is  consistent 
and  does  not  entail  an  effect  g.  Let  A  be  a  default  explanation  for  g 
in  S,  and  suppose  that  A  is  consistent  with  II  and  independent  in  II. 
Then  A  is  a  subset  (not  necessarily  proper)  of  some  diagnosis  for  g  in 

(c,e,  n). 

Conversely,  every  diagnosis  for  g  in  (C,E,U)  is  a  superset  (not  neces¬ 
sarily  proper )  of  some  default  explanation  for  g  in  (C',E,  £).9 

Proof.  Consider  the  theory  IT  formed  from  II  by  adding  c  =  ->c 
for  each  c  £  C.  The  diagnoses  of  { C,E ,  II)  are  just  the  diagnoses 
of  (Cufc'},  E,  II')  that  are  minimal  in  C,  keeping  only  atoms  in  C. 
Similarly,  default  explanations  of  (C',E,  S)  are  just  explanations 
of  (C  U  {c'),E,  S')  that  are  minimal  in  C,  keeping  only  atoms 
in  C.  The  result  follows  from  the  connection  between  diagnoses 
of  ( C  U  {c'},£,  II')  and  explanations  of  (C  U  {c'},  E ,  S')  given  by 
Theorem  6.1. 

In  general,  diagnoses  assume  more  than  is  really  needed  for  an  explana¬ 
tion:  hence  the  necessity  of  subset/superset  relations  in  the  correspondence 
theorems.  As  an  example,  consider  the  three  unconnected  inverters  of  Figure 
3.  Suppose  the  basic  axioms  governing  the  inverters  are  the  same  as  before; 
there  are  no  connections  between  the  inverters,  and  the  faults  are  coupled 
by  the  axiom  aba  3  (a b<,  V  ahc).  If  we  observe  {in0, ouff,},  there  are  two 
diagnoses,  {aba,  abb,  }  and  {aba,abc}.  There  is  only  one  default  explanation, 
namely  {ab„}.  The  diagnoses  all  give  a.  complete  state  of  the  system,  whereas 
the  default  explanation  is  only  the  set  of  abnormal  causes  that  account  for 
the  observed  behavior.  The  abductive  approach  distinguishes  between  direct 
causes  of  the  observations  and  irrelevant  causes,  while  the  consistency  based 
approach  does  not. 


9 While  default,  explanations  correspond  lo  diagnoses,  explanations  themselves  corre¬ 
spond  to  the  kernel  diagnoses  of  [3],  which  cover  the  set  of  possible  diagnoses.  We  do  not 
prove  this  result  here  because  of  space  limitations. 
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Figure  3:  An  unconnected  circuit 

7  Conclusion 

We  have  shown  how  to  extend  the  correspondence  between  abductive  and 
consistency  based  methods  t.o  the  case  of  causal  theories  that  have  arbitrary 
first-order  relations  between  causes  and  effects.  The  correspondence  requires 
that  a  domain  theory  expressing  how  causes  produce  effects  be  dosed,  that 
is,  contain  statements  that  the  only  causes  are  the  known  ones.  The  ap¬ 
propriate  closure  axioms  are  identified  in  this  paper  as  explanatory  closures. 
The  main  result  of  the  paper  is  that  minimization  of  causes  in  the  closed 
theory  produces  almost  the  same  explanations  as  abduction  in  the  original 
causal  theory.  The  caveat  is  that  the  abductive  explanations  are  generally 
weaker  than  their  consistency  based  counterparts.  There  are  two  reasons  for 
this:  adding  closures  may  change  the  available  explanations;  and  the  consis¬ 
tency  based  method  can  conclude  causes  that  are  intuitively  irrelevant  to  the 
observed  behavior. 

If  one  is  interested  in  the  representation  of  domain  knowledge,  then  the 
abductive  approach  offers  several  advantages.  It  does  not  require  the  as¬ 
sumption  of  complete  knowledge  of  causation,  and  it  is  not  necessary  to  as¬ 
sert  the  explanatory  closures.  Adding  the  closures  can  lead  to  inconsistency 
and  change  the  available  explanations  (although  it  will  not  add  new  ones). 
The  computational  aspect  of  adding  closures  is  also  discouraging,  since  there 
is  no  general  local  method  that  accomplishes  the  addition.  Stronger  global 


methods  such  as  circumscription  will  work  only  in  special  circumstances. 
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